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SECTION I 

BEST copy AVAItX8tE 

Introduction 

With the advent of an all-volunteer recruitment policy, the 
United States Air Force is replacing automated assignment of enlisted 
recruits to career fields with a guaranteed assignment program. Until 
recently, the majority of Air Force enlistees have been assigned to 
career fields by a computer assignment system. Early in basic training 
they were given an overview and audio -visual presentation of the fields 
available to them and asked to indicate their choices. The assignments, 
made by computer at the conclusion of basic training, used variables 
such as the number of persons needed in each field and the preferences 
expressed by the enlistees. 

In 1971, USAF recruiters began offering, prior to enlistment, 
guaranteed assignment in any one of 26 Air Force career fields. Since 
then, the program has expanded so that now, in 1973, there are 
approximately 132 career fields from which an enlistee may choose. 

Should this program expand even further, the responsibilities 
traditionally assumed by recruiters would be altered considerably. 
In addition to promoting the Air Force as a career, recruiters would 
be placed in the role of vocational counselors, guiding rec?:uits to 
potentially far-reaching commitments. The implications for reenlistment 
are obvious. 

However, to guide men into rewarding careers, the recruiters will 
need either a much broader background in counseling than most of them 
now have or effective devices for measuring the vocational interests 
of recruits. Vocational interest inventories, or questionnaires, are 
used extensively for this purpose by high school and college counselors. 
Unfortunately they are not appropriate for Air Force personnel. The 
questions, or items, do not reflect the nature of the activities of 
an enlisted man on duty. Furthermore, degree-of-job-satisf action is 
not a factor in the interest scales, or scores, of the inventories 
used in schools and colleges. These scales reflect either the 
respondent's membership in an occupational group or his response to 
groups of homogeneous item^. 

The Air Force needs a vocational-interest inventory for use in 
conjunction with their aptitudinal selection measures. The inventory 
should be designed specifically for Air Force recruiters who are 
administering a guaranteed assignment program. This report describes 
the initial stages of the development of such an instrument called 
the Vocational and Occupational Interest Choice Examination (VOICE) . 

A Brief Review of Vocational Interest Measures 



Vocational interest measurement has one of the oldest and most 
successful records in the history of psychological testing. Two 



excellent suitimaries and evaluations of the published inventories can 
be found in Euros (1965) and Robinson, Athanasiou, and Head (1969). 
Much of this section is based upon Chapter 13 of Robinson et al, where 
concise evaluations of 13 interest inventories, also reviewed in 
Euros, are presented in addition to a review of one developed since 
Euros' publication. In this section, some of the difficulties and 
shortcomings of many interest inventories that render their use by 
the Air Force inappropriate are enumerated. The aim is not to 
unjustly criticize these inventories, because many of them, especially 
the Strong and Kuder, have been used with considerable success in 
guidance programs but to explain why VOICE was developed for the 
Air Force's recruiting programs. 

The Strong Vocational Int erest Blank is one of the most highly 
regarded and well-researched inventories available. However, it has 
some psychometric problems, such as out-of-date norms and possible 
response set. Although the scale authors are continually making 
improvements, there is one problem that renders the Strong inapplicable 
for the Air Force; almost all the items and scales refer to occupations 
at the top fifth of the occupational status hierarchy. Very few refer 
to nonprofessional occupations, which largely make up the choices 
available to the enlisted man. Furthermore, the number of scales 
(112 on one form) makes interpreting a profile complex; and scoring 
the scales by hand is a tedious process, although a computerized 
scoring system could minimize this problem. 

The Kuder Preference Record — Occupational and Vocational are two 
highly rated and frequently used inventories. The occupational form 
was designed to compete with the Strong. The chief criticism of the 
Kuder forms has been against the interpretations of results based on 
the forced choice method of responding. Euros points out the 
difficulties of interpreting results in either a normative or an 
ipsiuive mode. It is believed that recruiters would have considerable 
difficulty making accurate and useful interpretations with either form. 

One recently published inventory which might have been considered 
for use by the Air Force is the Minnesota Vocational Interest Inventory. 
This inventory was designed to measure the nonprofessional occupational 
interests of men who enlist in the Navy. Development of the Minnesota 
Inventory, funded by the Office of Naval Research, began in 1946. In 
its evolution, the inventory moved from a military setting to a 
nonprofessional civilian setting. The item format is the same as that 
used by the Kuder and suffers from the same limitations created by 
the use of the forced choice format. Eoth occupational and homogeneous 
scales are presented. Euros does not review the Minnesota Inventory, 
but Robinson et al give it very favorable treatment. 

The Gordon Occupational Checklist was also designed for use in 
measuring nonprofessional vocational interests. It has been criticized 
because scores are given on a priori scales, with only an afterthought 
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for item analysis. Also, there is some difficulty with the definitions 
of the scales, as well as with the underlying factor structure. 

Other inventories that could be considered are the Picture Interest 
Inventory, Guilford-Zimmernian Interest Inventory, How-Well-Do-You-Know- 
Your- Interests, Geist Picture Interest Inventory, Curtis Interest Scale, 
Fowler-Permeuter Interest Record, Career-Finder, and Qualifications 
Record. All of them have numerous shortcomings: most frequently poor 
item construction and analysis, potential response set, statistical 
problems, and psychometric inadequacies of reliability, homogeneity, 
discrimination, and cross-validation. 
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SECTION II 



Development of Instruments 

Career Field Selection 

Eight career fields were selected to serve as a basis for developing 
the interest inventory. Each career field was in the guaranteed 
assignment program as of 1 May 1972, had a high inward flow of personnel, 
and was selected so that it represented two career fields from each 
aptitude index of the Airman Qualifying Examination (AQE) — Mechanical, 
Administrative, Electronics, and General, (The Airman Qualifying 
Examination was the operational selection and classification instrument 
used by the Air Force during the time period of the contract.) 

Since it was desirable to select from each AQE aptitude index area 
one career field w>th a requirement of a high and one career field with 
a low selector aptitude index all career fields with selector aptitude 
indexes of at least 80 in Electronics, 60 in General and Administrative, 
and 50 in Mechanical areas were reviewed along with career fields 
requiring a selector aptitude index of 40 or less in Administrative, 
General, and Mechanical and 50 or less in Electronics. 

FL'om each group, career fields with the largest Ns were selected 
for consideration. Brief descriptions of tl ese fields were examined and 
on the basis of the magnitude of N, similarity, and judgement, the 
eight career fields given in Table 1 were selected. 

AQE score distributions for each aptitude type and job classification 
in each field were requested so the fields could be compared for ovarlap 
in AQE scores. No significant overlap was found for career fields in 
the same area. 

Development of VOICE 

Construction of the interest scales began with an inventory of the 
ETS test collection and a review of all relevant occupational interest 
inventories. From these sources an item pool of 400 occupational 
interest items were written. It was decided to assign related items to 
separate interest scales or groups, for scoring purposes, for example, 
mechanical, computational, clerical and so forth. An a priori scale 
approach was planned for two reasons. First, the significance of 
response to single items is not very reliable. Second, the scale 
definitions would provide guidelines for writing items if alternate or 
parallel forms of the inventory were needed. Since occupational scales 
were to be developed, a pool of miscellaneous items was also written. 

The items were based on two principal criteria: (1) each item 
would be assigned to one of 14 scales thought to be relevant to Air 
Force career fields (with the exception of the items assigned to the 
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miscellaneous category) and (2) the language of the items should be 
appropriate for the probable reading level of recruited Air Force 
personnel. 

Each item presents an activity (e^g., "Tinker with a broken sewing 
machine") and asks the examinee to indicate whether he likes, dislikes, 
or is indifferent to the activity. The items were listed singly, not 
in groups. This format was chosen for its simplicity and efficiency. 
Zuckerman (1953) showed that individual item arrangement is more efficient 
than triad arrangement in terms of response time. Perry (1^53) conducted 
another study in which the item pools were of equal size and triads were 
compared with individual items. Although a slight superiority was 
shown for the triads, it was not consistent. Items arranged individually 
also have the advantage, of being easier to score. 

The scales originally developed resulted from a thorough review of 
the major occupational interest inventories extant and adaptations of 
them which were related to the Air Force career fields. In adlitxon, 
a review of the jobs within the various career fields suggested other 
scales which had no counterparts in other published inventories. 

\4hen the items were being written, two reference sources were used. 
One was AFM 39-1 which contained specific descriptions of Air Force 
jobs. The second was the Dictionary of Occupational Titles (DOT) (1969). 
It was thought that candidates for the Air Force would probably be more 
familiar with the nonmilitary counterparts of Air Force jobs, so that, 
where possible, the DOT descriptions of jobs similar to those described 
in AFM 39-1 were also used as source material. 

The final version of the interest inventory consisted of 400 items 
grouped in four general sections: Occupations (90 items), Work 
Tasks (210 items), Leisure Activities (70 items), and Desired Learning 
Experiences (30 items). The final version of the inventory prototype 
contained 13 separate scales in addition to a pool of miscellaneous 
items. The scales and the number of items in each scale are listed 
in Table 2. 

Development of Job Satisfaction Scales 

Prior to developing a job satisfaction scale, a search was conducted 
for existing job satisfaction scales and research dealing with them. 
The 13 scales given by Robinson et al (1969, Chapter 5) were examined 
in considerable detail. It was concluded that there are several factors 
that may affect job satisfaction. In general, the factors could be 
characterized as either extrinsic or intrinsic. In order to include 
each significant factor, a draft scale of 47 items was developed. 

From these items, four job satisfaction scales were developed and 
titled "Job," "Peer," "Supervision," and "Air Force." The 16-ltem Job 
scale was designed to measure intrinsic satisfaction with the actual 
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Table 2 

Final Interest Scales and Items in Each 



Scales Items 

Audiographic 18 

Food Service 18 

Pedagogy 12 

M-Scale 20 

Leadership 13 

Computational 26 

Health Service 28 

Scientific 34 

Electronic 31 

Mechanics 40 

Clerical 40 

Outdoors 33 

Academic 49 

Miscellaneous 77 
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work activities. The 10-item Peer scale was designed to measure 
satisfaction with one's fellow co-workers. The Supervision scale 
contained 12 items and measured satisfaction with one's supervisor. 
The Air Force scale was a brief 8-item scale aiming to measure 
satisfaction with working conditions in the Air Force. Items were 
written in brief form, often consisting of only one word. Items 
were also stated in such a way that positive response to some items 
would :'*.ndicate satisfaction and to other items dissatisfaction. 

The four job satisfaction scales are modifications of scales used 
in the Job Description Index (JDI) of Locke, Smith and Hulin (1965). 
An approach similar to that used in the JDI was considered appropriate 
because of its low verbal level and research indicating that the JDI 
scales have predictive, convergent, and discriminant validity ^ as well 
as internal consistency and stability (Robinson et al, 1969, pp. 105-107). 
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SECTION III 



Field Test 

Sample Selection 

VOICE was administered in a field test to a sample consisting 
of 4,800 airmen, 600 from each of the eight selected career fields. 
To be in the sampling frame for a given career field, an airman needed 
to have both AQE scores on file and at least six, but not more than 42, 
months of on-the-job experience. A random sample of personnel within 
each career field was selected from the airman tape files maintained 
by the Computational Sciences Division of the Air Force Human Resources 
Laboratory. 

In addition, the following information was obtained and forwarded to 
Educatior/al Testing Service (ETS) on each participant: his social security 
account number (SSAl^) , the Consolidated Base Personnel Office (CBPO) to 
which he was assigned, his AF speciality code and aptitude scores. 

The Role of the CBPO 

The participants in the field test were located at 128 CBPOs 
throughout the world. Each participating CBPO was notified by 
Headquarters USAF that they would be receiving an approved interest 
survey which should be distributed to identified personnel. Initial and 
follow-up mailings to the CBPOs, by the contractor, contained a letter 
from the Personnel Research Division outlining the project, instructions 
for survey administration, a roster of the participating airmen, and 
preaddressed sealed packets for distribution to the men. Each sealed 
packet contained a VOICE booklet, an answer* sheet, an introductory 
letter, a pencil, and a business reply envelope. The men were told 
to return their completed VOICE booklets directly to the ETS project 
directors. 

In addition, CBPOs in the continental United States whose bases 
had a large nonresponse rate were contacted by the Personnel Research 
Division of the Air Force Human Resources Laboratory. The response 
rate from those bases that were called increased significantly, 
particularly from those with large numbers of security specialist 
nonrespondents - 

Collection of the Completed Inventories 

As the completed inventories were received, they were checked for 
accuracy of identification information and omission of data. 

Inventories were collected until 3,537 had been returned. Of those 
returned, 3,104 were found to be useable. Of this total, 3,072 were 
found to be valid for analysis purposes after detailed scan-edit 
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procedures. All AFSCs had more than 300 usable returns, which was 
considered the miniraum number required for analysis. A complete 
t^'lly of the usable and unusable returns by career field and reason 
for nonuse appear In Table 3» 

The response rates achieved for the various CBPOs were also 
calculated. They were generally high. Of the 128 CBPOs involved, 
111 achieved response rates of 60 percent or better. A list of the 
CBPOs from whom cooperation was requested is given in Table 4 with 
the number of inventories mailed to the CBPO and the response rate 
achieved. 

Administration to Basic Trainees 

In order to obtain a "men-in-general" group, as well as to 
estimate the test-retest reliability of VOICE, the inventory was 
administered to a group of airmen who reported for experimental testing 
during basic training • VOICE was administered to 312 men in their 
sixth day of training. A sample o'' 211 from this same group took 
VOICE again on their twenty-ninth and last day of basic training to 
attempt to estimate the reliability of VOICE. From this total, 209 
valid cases were used as "men-in-general" group. Some of the men from 
the original group were lost because of discharges, setbacks to other 
flights, sick call on the test date, and various other reasons. Since 
it was difficult to know in advance exactly how many airmen would 
report for testing, every man in each flight was tested until the 200 
which the project directors had requested was reached. 

Data Transcription and Editing 

Useable answer sheets were batched (the unuseable ones wero not 
processed) and the responses transcribed on magnetic cape by means of 
a special purpose scoring machine called SCRIBE (Scanning, Comparing, 
Recording Instrument for Better Education). A quality control check 
indicated that the probability of more than one error per 100,000 
transcriptions was less than 0.01. Ail errors found were corrected. 

Data from the SCRIBE file were merged with the file obtained from 
the Personnel Research Division tape giving Airman Qualifying 
Examination (AQE) scores for the participating airmen. 
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Table 3 



Test Administration 



Useable Unuseable 
Returns Returns 



AFSC Number Reason Number 



2521 


457 


Incomplete 


121 


3040 


409 


Patterned Responses 


22 


4213 


361 


Wrong AFSC Number 


51 


4311 


364 


Completed by wrong SSAN 


1 


4730 


346 


Answer Sheet Number Unknown 


5 


O / 11 


10 / 


Returned Blank , no reason 


J/ 


7020 


385 


Claimed One Completed 


9 


8110 


315 


Separated or Discharged 


86 


Total 


3104 


Transferred or Reassigned 


25 






PCS 


33 






TDY 


11 






AWOL or Deserter Status 


4 






Patient Status 


2 






Military Confinement 


3 






On Leave 


1 






Addressee Unknown on Base 


22 






Total 


433 



GRAND TOTAL: 3537 
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SECTION IV 



Statistical Analysis 



Construction of Half-Samples 

Prior to analyzing job satisfaction data and developing scales, 
two half -Garaples were constructed. All airmen were grouped according 
to their career fields. Within each of the eight fields, each airman 
was ranked by his total AQE score. The total AQE score was the sum 
of the four uorraal deviates corresponding to the percentile scores 
usually reported for the scale. As a final step, each pair of airmen 
(in rank order) within each career field was considered and one of the 
two randomly assigned to Sample 1, the other to Sample 2. The result 
of this procedure was the formation of two half-samples of airmen in 
each of the eight career fields. The total AQE score was equal in 
expectation for half-samples in the same career field. The recruits 
forming the men- in-general group were also randomly divided. (This was 
a simple random division since no AQE scores were available for this 
group). Except where indicated, all of the analyses described were 
performed twice, once for each half-sample. 

Item Selection for A Priori Interest Scales 

To achieve more efficient scoring and eliminate heterogeneous 
items, a backward selection procedure was used to reduce the number 
of items forming an a priori scale. All a priori scales were limited 
to a maximum of 16 items. In the case of scales with fewer than 
16 items J the only items eliminated were those with negative correlations 
with the total scale score. The decision to use 16 items was based on 
the findings of Katz, Norris, and Halpern (1970), who achieved internal 
consistencies above 0.90 for interest items in eleven of twelve interest 
scales, using a similar item response format. 

Identical procedures were used independently with each half- 
sample and, subsequently, the combined sample, to select items for 
the recommended a priori scales. The procedure is summarized below: 

1. Compute the correlation between each item in the pool 
assigned to a scale and the total score based on the 
remaining items in the same pool. Items were scored 
on the basis of 1 = dislike, 2 = indifferent, 3 = like 
and each item was given an equal weight in computing 
the total scale score. 

2. Select the item with the lowest item-total correlation and 
discard that item. 

3. Rescore the scale total, eliminating the item identified 

in the previous step. Recompute new item-total correlations. 
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4. Repeat steps 2 and 3 until 16 items remain. 

5* Eliminate items with negative item-total correlations 
if fewer than 16 items were in the scale item pool. 

This process illustrates the logic of the scale construction. 
If this strategy were implemented on the computer, the several 
rescorings would be time consuming and expensive. Identical results 
can be obtained by manipulating the variance^covariance matrix of 
items. For each item p , the correlation with the total score based 
on the remaining items was computed by 



rp(t.p) = 



k 

( I 
d=l 



S .) 

P3 



k 

(( r 
i=l 



k 
3^P 



1/2 



(1) 



The quantity r^Ct.p) indicates the correlation of interet?t. 

The notation (t.p) .has been used in the subscript to show that the 
pth item has been removed from the total. The number of items is 
denoted by k , and the covariauce of items i and j is denoted by 
S. . . 

When the p th item was identified as having the lowest item-total 
correlation, correlations were formed by calculating 



k k k 

r i(t.pp') = ( S S ,.)/(( E E S..) (S , )^^^ (2) 

P d=l P J i=l d=l P P 

d?^p d^p 

d^p^ ±^p' d^p' 



and the pth item with the lowest value of r i (t.pp') was dropped. 
This process was continued until 16 items remained. 

Tlie process just described was designed to achieve a high degree 
of homogeneity among scale items. Internal consistency coefficients 
(coefficient alpha) were computed for each scale after item selection 
as follows: 

a = [1 - E sj/s2] 

X 
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where k represents the number of items In the scale, s. the Item 

2 ^ 
variance, and s^ the total scale variance for the k items. Whenever 

internal consistencies were calculated, as in the occupational keys and 
the job satisfaction scales, the same method was used. 

Ident if ying Satisfied and Dissatisfied Personnel 

A two-phase plan was initiated to assign individuals in each career 
field to satisfied and dissatisfied groups. First, within each field, 
satisfied and dissatisfied groups were formed according to their responses 
to the overall job satisfaction question. This item asked respondents 
to indicate their satisfaction with their Air Force job on a four point 
scale ranging from very satisfied to very dissatisfied. Personnel 
responding "very satisfied" or "moderately satisfied" were classed in 
the satisfied group, and those "moderately dissatisfied" or "very 
dissatisfied" were classed in the dissatisfied group. Second, a 
stepwise regression analysis (Draper and Smith, 1966, p. 171) was 
performed within each field to find the linear function of the four 
satisfaction scores which best separated the satisfied from the 
dissatisfied personnel. This linear function was then used to rank 
all personnel within a career field in terms of "satisfaction." The 
first subjects with the highest scores were chosen as the satisfied 

group where corresponded to the number of subjects indicating 

overall satisfaction. This resulted in the proportions of satisfied 
and dissatisfied personnel within each field being the same as those 
estimated from the overall question. 

Determining Occupational Scales 

Within each career field the zero order correlations between each 
interest item and a dichotomous criterion were computed. The 
dichotomous criterion was scored 1 if the subject was a member of the 
satisfied career field group and 0 for the comparison group, which 
was always men-in-general. The 50 items having the highest correlations 
(regardless of sign) with the group membership criterion were selected 
from the 400 items. These 50 items served as Independent variables In 
a stepwise regression analysis with the dichotomous criterion. This 
procedure was equivalent to a stepwise discriminant analysis for the 
two-group case (Beaton, 1964) and served the purpose of selecting a 
final set of items. Items were added until the Increment in the 
squared multiple correlation was less than 0.0025. The variables 
added up to this point constituted the occupational scale with unit 
weights, the weights arrived at through the stepwise regression 
analysis were retained for comparative purposes. It was necessary 
to reduce the search for items to comprise these scales from 400 to 50, 
because 400 independent variables is too large for most stepwise 
regression computer programs. Also, scales consisting of as few items 
as possible make scoring easier. It was believed that 50 items would 
provide a manageable pool and be large enough to produce occupational 
scales of sufficient quality. 
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Predicting CriCerion Groups with A Priori Scales 



A similar procedure was used to arrive at the best linear Tunc tion 
of a priori scale scores for separating each group of satisfied personnel 
from men-in-general within each career field. A stepwise regressic7n 
analysis was performed, using the dichotonious criterion representing 
group membership. Scales adding at least 0.0025 to the squared multiple 
correlation were retained. 

Multiple Group Discriminant Analysis 

Using the 13 final a priori scale scores, a 16 group discriminant 
analysis (Cooley and Lohnes, 1962, Chapter 6) was performed oa the 
eight satisfied and eight dissatisfied career groups. A siniilai 
sixteen-group analysis was also performed using the occupatioaal scaler.. 
The purpose of the discriminant analysis was, in part, to further 
validate the scales but also to check the positions of the various 
groups in the discriminant space. For example, the analysis would 
determine the proximity of the two groups from a similar cireer field 
in relation to groups from other career fields. Evidence of this 
nature would provide supplementary evidence as to the validity and 
usefulness of the scales developed. 

Cross-Validation 

The scoring weights derived from the series of stepwise regression 
analyses in one half-sample were applied to the data in the other half- 
sample. Within each field the appropriate occupational scale score or 
linear function of a priori scales was used to predict whether the , 
subject was classified as a member of the satisfied group or of the 
men- in-general group. Dissatisfied personnel were net included in the 
cross-validation. Clasirif ication rules which minimized the probability 
of misclassif ication were developed for each career. 

For each individual, the probability of membership in one of the 
two groups could be computed according to the formula (Cooley and T.ohnes, 
1962, p. 138): 



p.. [H. |y.] 



E — e 



2s, 



(4) 
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where P[Hj|y^] indicates the probability that the i th person with 

scores on the derived linear function of items or scales belongs 

to group IK . In this case, there are two groups and H2 

The p. is the proportion of satisfied personnel obtained in the 

cross-validation sample; the is the estimate of the standard 

deviation of the linear function or score estimated from the analysis 
sample. 

By taking the logarithm of the numerator of the quantity of the 
right side of the equation and substituting the sample quantities, 
the following classification rule is obtained: 



log 



NS 



(y - y^)^ 

2s2 



log 



N (y^ - y )^ 

in i m 



NS 



m 



2s' 



(5) 



where N is the number of subjects classified as satisfied in a 
c 

given career group in Sample 2; is the number of subjects in the 

men-in-generdl grouu in Sample 2; N - N + N : 3 and S are 

the standard deviations of Y estimated for the given career group 

and the men--in-general group, respectively, in Sample 1; and Y^ 

are the estimated means of the function Y in Sample 1 for the career 
group and men-in-general group, respectively. 

If C > 0 , a subject was classified in the career group. If 
C < 0 , the subject was classified in the men-in-general group. 

Hits and Misses were computed as follows: 

1. If C > 0 and the subject was in the career group, classify 
as a career hit . 

2. If C < 0 and the. subject was in the career group, classify 
as a career miss. 

3. If C > 0 and the subject was in the men- in- general group, 
classify as a men-in-general miss. 

4. If C < 0 and the subject was in the men- in- general group, . 
classify as a men-in-general hit. 
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For the occupational scales > the proportions of hits (correct 
classifications) obtained \^±th unit weighting were compared with the 
proportions of hits obtained using the exact weights estimated in the 
discriminant analysis. For the a priori scales, weights rounded off 
to integer values were compared to the exact weights. 

Obtaining Final Estimates 

Once the cross-validation was completed, data from the two half- 
samples were recorabined and final scales were derived on ths? basis of 
all the data. This Is, the steps performed for each half-sample were 
carried out for the entire sample. Satisfied and dissatisfied groups 
were defined, items for the a priori scales were selected, and 
occupational scales were developed using the entire sample of satisfied 
airmen within a career field as the criterion group and the entire 
men-in-general sample as the reference group. A final set of linear 
functions of the a priori scales were also developed in this wny. 
The cross-validity estimates can be regarded, tiien, as iov;or bound 
estimates for the error in the classification rules based on the 
entire sample, since they are based on only half the data. 

In addition, all means, standard deviations, and intercorrelations 
were computed for the final scales, and test-rutest correlations for 
all scales were computed on a subsample of 209 men-in-general. 
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SECTION V 



Results 

Characteristics of the Job Satisfaction Scales 

The four job satisfaction scales, Job, Peer, Supervision, and 
Air Force, appear to possess a sufficient degree of reliability and 
some discriminant validity, as can be seen in Table 5. This table 
gives the correlations between the four job satisfaction scales, as 
well as the correlation between the satisfaction scales and the single, 
overall job satisfaction item. The estiinates of internal consistency 
have been placed in parentheses. 

The estimates of internal consistency for the Job and Supervision 
scales were comparable to those reported for Smith's JDI (Robinson et al, 
1969, pp. 105-106), which were in excess of 0.80 for her five scales. 
Internal consistencies for the Peer and Air Force scales were 
considerably below that standard. Although the Air Force scale had 
only eight items and the lowest internal consistency, the Supervision 
scale had the highest internal consistency, even though the Job scale 
had more items. 

Judging from the magnitude of the correlations between scales, 
the four scales do not appear to be statistically independent. This 
result was also reported for the JDI. However, the high correlation 
reported for the JDI between Work and People was not found between 
the VOICE scales for Job and Peer, even though the two pairs of scales 
are similar. These positive correlations indicate that the VOICE 
job satipr action scales may be measuring a general job satisfaction 
factor a largd extent. 

The four correlations between each of the scales and the single, 
overall job satisfaction item indicate that each scale is positively 
associated with overall satisfaction. However, satisfaction with the 
intrinsic nature of the work activities which a career entails, as 
U reflected in the Job scale, is of major importance. 

Development of Rules for Classi f ying Satisfied Personnel 

Table 6 shows the multiple correlati^'ms for the total sample 
between scores on the job satisfaction scales and the single overall- 
satisfaction question. Within the eight careers the correlations 
ranged from 0.68 to 0.60. The standardized regression weights for the 
four scales indicate, with a high degree of consistency, that the 
score on the Job scale is of paramount importance in predicting how 
an airman responds to the overall job satisfaction question. When 
t-tests of the significance of standardized regression weights are 
performed, they show that in most instances the contributions of the 
remaining scales were not significant. Inspection of Table 6 also 
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Table 5 



Satisfaction Scale Reliabilities 
And Intercorrelations 



Scale 


Job 


Peer 


Supervision 


Air Force 


Overall 


Job 


(•840) 


.348 


.466 


.454 


.722 


Peer 




(•742) 


.476 


.351 


.260 


Supervision 






(.884) 


.379 


.343 


Air Force 








(.597) 


.373 



Internal consistency coefficients are shown in parentheses. The 
correlations were based on the total sample of 3,072 airmen. 
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reveals Lliat Llio regression systems for the eight: careers were higlily 
similar (in that weiglits for Job were high) , though the proportions 
of satisfied personnel differed within each field. Table 7 shows 
the regression weights and multiple correlations for the half-samples. 

Total job satisfaction scores for each career field were generated 
using these regression systems. Groups of satisfied and dissatisfied 
personnel were distinguished by cutting scores determined by the 
proportions of satisfied and dissatisfied responses to the overall 
satisfaction question. Table 8 shows the numbers of satisfied and 
dissatisfied personnel Ln each career, along with the proportion of 
men in each field who were satisfied with their jobs. 

Since only satisfied personnel were to be used in developing 
scales, a desirable outcome would have been for substantial numbers 
of men within each career to have indicated satisfaction with their 
work. Unfortunately, in five of the eight careers fewer than half 
the men indicated satisfaction with their jobs. Within Security 
Specialist only 24.4 percent were satisfied, which reduced the total 
sample of satisfied personnel within this field to 43 in Sample 1 
and 32 in Sample 2. This suggests that an initial screening to 
identify satisfied personnel might have been useful. VOICE could 
have been administered to airmen identified as satisfied and to a 
sav:\ple of those who were dissatisfied. 

Selection of Items for tlie A Priori Scales 

The items for the VOICE a priori scales were selected independently 
for each Iialf-saraple, according to the statistical procedures described 
earlier. Since the objective was to construct homogeneous scales, 
the responses of every airman with a valid AFSC were used to choose 
the scale items regard Less of the degree of job satisfaction the men 
expressed. The result Lny; sample sizes were 1,^^37 for Sample 1 and 
1,535 for Sample 2. 

The itr .s selected in the half-samples were relatively consistent. 
In six of the thirteen scales, identical items weu cliosen. Five 
scales had fewer than four items that were unique for half-samples. ^ 
Only tiie Scientific anci Outdoor scales were substantially different. 

The differences in the Scientific scales of the two samples can 
probably be atLrLhuted to- their orientation. The eleven items unique 
to tlvj Sample 1 scale ran be associated with academic activities; for 
example, reading about y^rcut sclent LsLs and studying astronomy, 
chemistry, meteorology, microscopes, nuclear reactions, pliysics, and 
radiation belts. Tliese are all activities and courses likely to 
take place in scliools and colleges. On the otiier Iiand, the items 
unique to Sample 2 represent a technical and operational orientation — 
[)er form] n)', c?xpuriments , ch.'termlning concentrations, helping scientists, 
devising', special etju i piiient , determining the age of foods, keeping 
records, classifying rocks, and using microscopes. 
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As for the two Outdoor scales, the activities of Sample 1 are 
chiefly leisure activities; those of Sample 2 represent occupations. 
The Sample 1 scale included canoeing, hunting, sailing, fishing, camping, 
playing Softball, picnicing, and riding trail bikes. The Sample 2 
occupations and activities are performed outdoors: longshoremen, 
lineman, lumberjack, mason, surveyor, mowing lawns, pouring concrete, 
planting trees, and roofing. 

Of course, these conclusions are speculative, but they do suggest 
considerations for further development of the a priori scales. The 
factor structure of the inventory and its relation to the a priori 
scales appear to be important. Since the a priori scales served as a 
model for constructing items, analysis of the factor structure of the 
total inventory would serve as a test of the a priori structure and 
would possibly provide alternative formulations for broad area scales 
like the a priori scales. 

The items selected for the a priori scales, on the basis of 
Sample 1 combined with Sample 2, are given in Table Al in Appendix A. 
In the Scientific scale, most of the competing items came from Sample 2; 
thus, the scale is oriented toward technical aspects of scientific 
endeavor. The Outdoor scale contains predominantly items indicating 
outdoor occupations , although there are a significant number related 
to outdoor leisure activity. 

As indicated previously, the a priori scales were constructed 
with the aim of selecting items that were homogeneous. Internal 
consistency coefficients were calculated, using coefficient alpha, 
for each scale in each half-sample and the combined sample. These 
coefficients are presented in Table 9. The Sample 2 coefficients were 
larger than those obtained for Sample 1 for each scale; the coefficients 
for the combined sample were between those obtained for Samples 1 and 2. 
The coefficients obtained were less than those obtained by Katz et al 
(1970, p. 38), who achieved internal consistencies above 0.90 for 
10 to 12 scales, but they were approximately the same as those presented 
in the examiner manual for the Kuder Preference Record-*Vocational 
(Kuder, 1956, p. 21). 

Selection of A Priori Scales for Cross-Validation 

In using a priori scales to identify the occupational interests 
of recruits that will be similar to those of airmen satisfied with 
their careers, it is desirable to use fewer than the 13 scales. 
Moreover, it can be assumed that satisfaction in each career field 
depends upon varying combinations of interests. For example, one would 
expect interest in mechanics to be essential to satisfaction with the 
work of a General Purpose Vehicle Repairman (AFSC 473X0); one would not 
expect interest in mechanics to be a factor in satisfaction with vjork 
as an Administration Specialist (AFSC 702X0). 



Table 9 



Internal Consistencies for the A 
The Two Half Samples and the 


. Priori Scales 
Combined Sample 


for 






Internal 


Cnnsistency^ 

— 




Scale 


Sample 1 




Sample 2 


Combined 


Audiographic 


.8914 




.8983 


.8949 


Food Service 


.8907 




.8974 


.8941 


Pedagogy 


.8906 




.9007 


.8957 


M- Scale 


.8131 




.8236 


.8167 


Leadership 


.8458 




.8515 


.8486 


Computational 


,9255 




.9317 


.9274 


Health Service 


.8985 




.9007 


.8996 


Scientific 


.9089 




.9293 


.9279 


Electronic 


.9404 




.9445 


.9425 


Outdoors 


.8230 




. 8935 


. 8711 


Mechanics 


.9208 




.9233 


.9227 


Clerical 


.8955 




.9006 


.8988 


Academic 


.8811 




.8932 


.8921 



Measures of internal consistency obtained by calculating coefficient alpha, 




where k is the number of items in the scale; s, is the item variann ; 

1 

9 

and s" is the total variance. 
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In order to duterinine which a priori scales to use for thv. cross- 
validation and what weights to apply to them, a stepwise regression 
analysis was performed for the scales developed for each half-sample* 
The dependent variable in these analyses was either satisfaction with 
a career field or membership in the men-Ln-general group. Analyses 
were performed independently for each field and each half-sample . 
Scales were included until the increase in the squared multiple 
correlation was less than 0.0025. The resulting scales for each career 
field in each half-sample, the combined sample, and their weights 
appear in Tables A2 through A13 in Appendix A. A negative weight 
in the tables indicates that individuals in the career field scored 
higher than individuals in the men-in-general group. Standardized 
regression weights , multiple correlations, and standard errors are 
also given in these tables. 

The scales indicated by the analysis to reflect satisfaction with 
a given career field are logical. For example. Mechanics was the first 
scale, and thus the scale with the highest zero order correlation with 
the criterion, selected among men in the two careers in the Mechanics 
AQE requirement group (Aircraft Maintenance Specialist and General 
Purpose Vehicle Repairman). Similarly, the Electronic scale was the 
first scale selected among personnel serving as Radio Relay Equipment 
Repairman and Ground Equipment Repairman, the Computational scale among 
Accounting Specialists, and the Clerical scale among the Administration 
Specialists. 

Selection of Items for the Occupational Scale 

Occupational scales were developed for each of the eight careers. 
They were developed independently in each half-sample for cross-validation 
and in the combined sample for the recommended occupational scale. 
The stepwise regression technique used was equivalent to a discriminant 
analysis, which gives weights to independent variables (items in this 
case) so membership in one of two groups can be predicted with a minimum 
of error. The two groups considered were satisfied airmen in the 
career field for the scale being constructed and the recruits serving 
as men-in-general. 

The number of items selected for each scale, multiple correlation, 
and number of satisfied airmen in each career field are presented in 
Table 10 for each half-sample. Few items were selected for both 
Samples 1 and 2 — no scale had more than ten. A detailed list of the 
items selected, along with the standardized regression weight, the 
regression weights and their standard errors appear in Tables AlO 
through A18 in Appendix A. Negative regression weights indicate an item 
was preferred by the raen-in-general group. Thus, for the Weather 
Observer scale based on combined samples, satisfied members of the 
career field responded more favorably than men-in-general to studying 
meteorology, planting and taking care of a vegetable garden, studying 
calculus, making weather forecasts, visiting a museum, helping write 



Table 10 



Number of Items, Multiple Correlation, and Number of 
Satisfied Airmen in the Career Field 

SAMPLE 1 

Number 



Occupational Scale 


Items 


Multiple r 


Satisfied 


Weather Observer 


19 


.7529 


lOS 


Radio Relay Repairman 


29 


. 7914 


111 


Hround Equipment Repairman 


28 


.7291 


71 


Aircraft Maintenance 


29 


.7090 


98 


Vehicle Repairman 


27 


.7640 


92 


Accounting Specialist 


17 


.8289 


126 


Administration Specialist 


15 


.7108 


87 


Security Specialist 


38 

SAMPLE 2 


.7369 


43 


Occupational Scale 


Items 


Multiple r 


Number 
Satisfied 


Weather Obse:rver 


22 


.7649 


117 


Radio Relay Repairman 


21 


.7524 


115 


Ground Equipment Repairman 


24 


.7148 


72 


Aircraft Maintenance 


24 


.6385 


92 


Vehicle Repairman 


23 


.7723 


93 


Accounting Specialist 


38 


.8151 


132 


Administration Specialist 


23 


.6781 


94 


Security Specialist 


27 


.7520 


32 
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questions for a test, writing a computer prograin, learning; to n/ivigate 
a boat, being a teacher, drawing blueprints for a bridge, and solving 
geometry problems. Compared to men-*in-general , they tended to dislike 
marching in a parade, watching.; draj; racing, organising a military drill 
team, installing a lelepi^one, fighting a fire, and constructing 
ma theraat ical tab les • 

Two sets of weights were retained for the half-sample analyses 
and cross-validation. One set was the actual regression weight; the 
others were plus or minus one, depending on the sign of the regression 
weight. 

Means and Standard Deviations 

Means and standard deviations for all final scales were computed 
for each career field and men-in-general and are presented in Table A19 
in Appendix A, All scale scores were first converted to a scale with a 
mean of 50 and a standard deviation of 10 for the entire sample of 
3,072 airmen in the eight career fields. The men-in-general data were 
not used in obtaining the conversion parameters, but means and standard 
deviations for the men- in-general are reported on the converted scale. 

For the most part, the men-in-general means and standard deviations 
closely approximate the overall values of 50 to 10, The largest 
departure from the mean of 50 occurred for the M-Scale, This is not 
surprising, since this particular scale entered early into the stepwise 
regression systems with more consistency than any other a priori scale. 
One might speculate that the difference is in large part the result of 
a shift in mean scores, which would occur as a recruit gains experience 
with military life. To put it another way, many of the activities or 
careers described in the M-Scale may lose some of their appeal once they 
are more directly encountered by an individual. Interestingly, satisfied 
personnel within all eight career fields had higher mean scores on both 
the M-Scale and the Leadership Scale than did their dissatisfied 
counterparts. Other a priori scales exhibited predictable patterns 
with high mean scores for satisfied personnel within the satisfied 
career fields logically related to a particular scale. Satisfied 
Accounting Specialists, for example, obtained a mean of 57,774 on the 
Computational scale versus a mean of 45,920 for dissatisfied Accounting 
Specialists and 48.082 for men- in-general. Similarly, satisfied Radio 
Relay Repairmen obtained a mean score of 57,288 on the Electronics scale 
versus 52.511 for chose dissatisfied within the same career field and 
49,546 for men-in-general. 

The occupational scale means exhibit a different but expected 
pattern. The highest means, of course, are obtained by the satisfied 
individuals in the career for which the scale was developed, the lowest 
for men-in-general, with the difference in means between these two 
groups averaging about one and one-half standard deviations. The 
dissatisfied personnel within the same career fields, however, achieved 
means almost as high as the satisfied personnel* A clear implication, 
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on the basis of the occupaLional scales, is that almost as many 
dissatisfied personnel would be placed in theLr present career fields 
; I s wo ul d s a t is f 1 c d a± rme n . 

One factor whicli may, in part, account for this result is the 
explicit selection which took place when the career personnel involved 
Ln the present study were originally assigned to their respective 
fields. Ln order to be assigned to the Weather Observer field, for 
example, a man had to have an AQE General score at or above the 
HOth percentile. Thus, the Weather Observer group was subjected to 
explicit select Ion on (Juneral AQE and to incidental selection on ail 
variables eorrelated with General AQE. An item migiit reveal differences 
between the men-in-gene ral group and the satisfied career groups merely 
because the Item was subject to the effects of incidental selection. 
I t most items in a scale fell into this category, one would expect 
I lie results to be fairly close to the results shoy>^n in Table_A^^ 
It can be seen tliat tlie a priori scales which were not subjected l.o 
incidental selection during tiieir construction exliibit a different 
pattern, with larger mean differences between satisfied and dissatisfied 
groups on the scales logically related to a given career group.. Judging 
from the data presented in Table A19, use of the a priori scales on a 
purely logical basis would probably result in fewer dissatisfied 
[>ersonuel being reassigned to the same career. 

The method which employs a men-in-general versus a career criterion 
group has been used as the primary technique in keying occupational 
inventories. Clark (1961) states ^*...a scoring key will be considered 
good if it does a good job of separating workers in a given occupation 
from workers in general." Campl^ell (1971) comments on the Strong 
Vocational Interest Blank: '*The main purpose of men-in-gerieral is to 
establish the general level of popularity of an item; this can then 
be compared with the rate of endorsement of the occupational sample to 
locate items that tlie members of the criterion sample answer differently 
from tlie reference sample.*' 

In Lhu- reeommendations section, several possible alternatives to 
the Lradllioiial use of a men- in-general group are suggested. 

Correlations among the Scales 

(Correlations among the various scales are given in Table 11. Those 
for the occupational scales, in the upper portion of the triangular 
matrix, arc^ gene?rally low, with the possible exception of a clustering 
t)f the career I'Lelds with Electrical and Mechanical AQE requirements. 
The interi'Ml correlations for these four careers tend to be moderately 
high, although considerably lower than those presented by Campbell 
(1^)71, pp. 'H)~4l) for occupations of a similar nature. 

CorreLations among the a priori scales are given in the right 
coriier of tiK' Lriangular matrix. Each correlation is positive, in 
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contrast to that found by Clark (1961, p. 65) and Kuder (1956, p. 21) 
and similar to that obtained by Katz et al (1970, p. 33). The fact 
that some inventories report predominantly positive intercorrelations , 
while others report intercorrelations that are mixed in sign, most 
likely represents the differences in item format used. If a forced- 
choice format is used, as in Clark and Kuder, correlations of a mixed 
nature are likely to result. On the other hand, if there is no 
constraint on responding to an item, a general willingness to respond 
either favorably or unfavorably will be indicated by the positive 
correlations. 

In order to get some notion of the structure of the relationship 
among the a priori scales, scales having intercorrelations greater 
than 0.60 were identified. Similarly, scales correlating between 
0.55 and 0.60 were noted. The interrelationships of the scales were 
represented as a network, with the scales serving as points and the 
correlations above 0.60 as arcs forming the network (Figure 1). One 
can see tliree clusters forming. One represents Electronic-Mech^niics- 
Outdoors-M-Scale activities, a second represents the remaining scales 
except for Food Service, which stands alone even though it relates 
somewhat to the scales in the second cluster. 

The purpose of presenting data in this form is to provide some 
suggestions for modifying the a priori scales in subsequent studies. 
It should be emphasized that the subject of this report is the initial 
phase in the development of an effective vocational interest inventory. 
The primary purpose of this phase was to develop an item pool, field 
test the pool, and construct experimental scales. Subsequent efforts 
should be aimed at modifying the experimental scales, adding new scales, 
and deleting redundant or misleading scales. Examination of Figure 1 
may suggest modifications of the a priori scales. For example, the 
Scientific and Pedagogy scales each have correlations of 0.60 or more 
with five other scales. Further, four of the five high correlations 
are common for both scales. This suggests that it might be profitable 
to attempt to combine these scales. 

Correlations between the a priori scales and the occupational 
scales appear in the lower left corner of the matrix in Table 11. 
These correlations seem logical, in that scales for career fields 
that are mechanical in nature correlate highly with mechanical 
interest. The same holds generally for Electronic and Clerical 
career fields. 

Test-Retest Reliability Estimation for the Scales 

In order to estimate the reliability of the scales obtained from 
the combined sample, scores on the scales obtained from the two 
administrations of VOICE to the subsample of 209 men-in-general were 
correlated. It should he recalled that these men were in basic training 
at the time of both administrations. 
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The correlations were low in comparison to those for other interest 
scales. Correlations ranged from 0.35 to 0.55 for the occupational scales, 
with virtually all scales having correlations within 0.05 of 0.50. 
Correlations for the a priori scales were higher and ranged from 0.54 to 
0.74. Due to the fact that these correla ions were considerably lower 
than those obtained by both Strong on the Vocational Interest Blank and 
Clark on the Minnesota Vocational Interest Inventory (they consistently 
obtained test-retest correlations of 0.80 or higher for their scales) 
and lower than one might expect by examining the internal consistency 
estimates, the validity of these correlations as reliability estimates 
must be questioned. 

In obtaining estimates of reliability for an instrumenL administered 
on two separate occasions, the instrtiment must be administered 
independently and under identical conditions. In theoretical terms, 
this means that the distributions must be identical for each trial; that 
is, the expected value or mean of the measurement must be the same for 
both administrations. Further, the variances must be the same for both 
administrations. For detailed discussion of conditions necessary for a 
test-retest correlation to serve as an estimate of reliability, the 
reader is referred to Lord and Novick (1968, Sections 3.3 and 3.4). 

To determine whether the correlations obtained were valid estimates 
of reliability, thci conditions were examined to see how appropriate they 
were. The differences between the raw score means and raw score variances 
for each trial were obtained and are presented for each scale in Table 12. 
In addition, an F-test was performed to test the equality of the raw 
score variances, and a t-test was performed to assess the significance 
of the difference in means where the variances were concluded to be 
identical. The significance level used was the 0.05 level, with a "yes" 
entry in Table 12 indicating significance. 

Certain features of the data stand out. For example, all the 
differences in the means have the same sign for each type scale. 
Although this result is difficult to interpret for the occupational 
scale, since both positive and negative items weights comprise a scale, 
an interpretation can be made for a priori scales. Apparently, there 
was a greater tendency to dislike items on the second administration, 
which was reflected in the consistently positive mean differences. 
Also, with one exception, the raw score item variances for the first 
administration were larger than those obtained on the second administration. 

In examining the significance tests, six of 21 tests for equality 
of variance were significant. Eight of 15 tests for zero mean differences 
were significant. Only seven scales failed to achieve significant 
differences in either raw score mean or variance. These results, taken 
simultaneously, indicate that the conditions, which were necessary to 
validate the correlations obtained as accurate reliability estimates, 
were generally not satisfied. Thus, the obtained correlations cannot be 
used as reliability estimates. 
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Further evidence of the lack of parallelism between the two VOICE 
administrations appear in Table 13, which gives the correlations among 
the a priori scales for the two administrations. The upper-right 
triangular matrix represents correlations obtained for the first 
administration, the lower-left triangular matrix represents correlations 
obtained for the second administration. If one expected the conditions 
of the t\-jo administrations to produce equal scores and variances for 
both administrations, one would expect the correlations among the scales 
to be equal, at least to the extent achieved in the previous samples. 
This did not happen. The correlations for the second administration were 
consistently higher than those of the first, indicating more consistent 
responses to all items in the second administration. Since the 
correlations differed considerably, one must conclude that reliability 
was not being measured. 

The correlations obtained may be of some use though. If the means 
vary with time, as the data suggest, the correlations are known to under- 
estimate the true reliability (Cochran, 1970). 

The actual correlations obtained for each scale have not been 
presented since such a presentation might result in an inaccurate 
condemnation of the scale reliabilities. Estimation of scale reliabilities 
should be performed in the future under rigid conditions so that the 
estimates may be reported. 

Cross-Validation 

The percentages of correct classifications (hits) within each career 
field, using each of four different methods, are presented separately 
for each half-sample in Table 14. Two base rates are shown (at the 
far right) for comparative purposes. Base 1 is a ^'maximum blind 
strategy" in that it will yield the highest possible percentage of 
expected hits with the absence of any information that might relate 
individuals to groups. Under such conditions, classifying all 
individuals in the larger of the two groups will achieve the most hits. 
Base 2 represents proportional random assignment — a less than optimal 
strategy but one that might be employed in a situation where no useful 
information about individuals is available, but quotas must be met. 
Under this system, individuals are assigned at random to one of two 
groups until a quota is filled for one of the groups. The same 
classification proportions or probabilities were used to compute the 
base rates as were used in the classification functions. 

Except for Security Specialist all of the methods bettered the base 
rates with some consistency in both samples. Since so few individuals 
were classified as satisfied within Security Specialist, it is not 
surprising that the cross-validations yielded unimpressive results in 
this field. As Table 14 shows, the occupational scales appeared to be 
superior to the a priori scales for classifying individuals accurately. 
Nevertheless, Table 14 indicates that both types of scales possess 
considerable value for classification purposes. 
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Tliough there was virtually no difference between the unit weights 
and exact regression weights for the occupational scales in Sample 1, 
there appeared to be a slight difference in favor of the regression 
weights in Sample 2 when the total percentages of correct hits were 
considered. There appeared to be little difference between the 
regression weights and the integer weights when the a priori scales 
were used to classify individuals. 

Table 15 shows the percent of correct classifications within career 
groups and men-in-general groups. Since the decision rules used for 
classification were aimed at minimizing the total number of errors, 
there tended to be some overassignment to the larger men-in-general 
group. It is important to recognize that the within-group hits do not 
imply that the interest scales developed are better for the men-in-general 
group. Had the satisfied career groups been larger than men- in- general , 
the overassignment would have been to the career groups. 

Discriminant Analyses 

The first discriminant analysis with 16 groups and the a priori 
scale scores as independent variables extracted 13 latent roots and 13 
corresponding discriminant functions. In Appendix A, Table A20 
presents the 13 vectors, or orthogonal linear functions, of the a priori 
scales. Table A21 in Appendix A presents the '^centroids" of each group 
on each discriminant function. 

Figure 2 presents the configuration of the 16 groups in the two 
dimensions, defined by the first two discriminant functions. The first 
discriminant function is characterized by a relatively large positive 
weight for the Computational scale and relatively large negative weights 
for the Electronic and Mechanics scales. The two Accounting Specialist 
groups had the highest negative positions on this discriminant function; 
the highest positive values were accorded the two Vehicle Repairman 
groups. The factor underlying the first dimension might be interpreted 
as a bipolar factor, with one end characterized by high interest in 
Computational tasks and low interest in Mechanics and Electronics 
and the other end characterized by high interest in Mechanics and 
Electronics and low interest in Computational tasks. 

The second discriminant function can also be interpreted as a 
bipolar factor. One end is characterized by high scores on Electronics 
and Science and low scores on Clerical, and the other end is characterized 
by the reverse pattern. The most extreme groups are the satisfied 
Radio Relay Repairman group and the satisfied Administration Specialists. 

In terms of the distances between groups on the first two 
discriminant functions, the satisfied and dissatisfied groups within 
a career field are relatively close. This suggests that the interest 
patterns ji the two groups within a field are more similar than for 
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■ - Satisfied 
Dissatisfied 

1 - Weather Observer 

2 - Radio Relay Repairman 

3 - Ground Equipment Repairman 
U - Aircraft Maintenance 

$ - Vehicle Repairman 

6 - Accounting Specialist 

7 « Administration Specialist 

8 - Security Specialist 
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groups in different fields, which is to be expected. Some exceptions 
do occur, however. For example, the satisfied Ground Equipment Repairman 
and dissatisfied Vehicle Repairman fall into a fairly tight cluster with 
the two Aircraft Maintenance groups. 

The second discriminant analysis was performed with the 16 groups 
again serving as dependent variables and the eight occupational scales 
serving as the independent variables. Tables A22 and A23 in Appendix A 
present the eight latent vectors derived and the centroids of each group 
on each discriminant function. The first two latent vectors from this 
analysis can also be interpreted as bipolar factors with each having one 
high positive and one high negative weight corresponding to a scale. 
As Figure 3 shows, the configuration of the 16 groups on the first two 
discriminant functions is quite similar to that in Figure 2. 

Conclusion 

Our statistical analyses indicate that the experimental inventory 
possesses considerable utility for distinguishing among career groups 
and for distinguishing between satisfied and dissatisfied personnel 
within career fields and a men-in-general group. Based on the results, 
it is estimated that an inventory of 246 items would be required to 
construct all scales in final form. The sample sizes used to develop 
the occupational scales were minimal at best and in some cases, 
inadequate. Campbell (1971) suggested using two independent samples of 
200, stressing homogeneity and making sure that men who perform the 
occupation in some unusual form are not included. The sample sizes 
used to develop the final occupational scales for the eight career fields 
of VOICE ranged from 75 for Security Specialist to 258 for Accounting 
Specialist. The cross-validation results which were based on only half 
the number used for the final scales reflect the differences in sample 
sizes in the different careers. In fields where large samples were 
available, the occupational scales generally crossed better than in 
fields where smaller sample sizes were available. The cross-validity 
estimates are, of course, lower bound estimates of validity since they 
are based on only half the sample. 
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SECTION VI 



Recommendations 

For most of the Air Force career fields Included in the field 
test of VOICE, the validity of the occupational and a priori scales as 
measures of recruits' vocational interests has been demonstrated. The 
evidence presented in this report indicates that by means of these 
scales it is possible to classify men with considerable accuracy in 
either an occupational or basic recruit group. 

However, a number of problems have come to light which require 
solution prior to the operational use of VOICE. Among these problems 
are the similarity of the interest patterns observed in dissatisfied 
and satisfied groups within a career, the rather small criterion 
groups which were available, and the apparent shift in the interest 
patterns of basic recruits over a period of three weeks. Although 
a final set of items is recommended for the inventory, it is suggested 
that the following additional research be undertaken. 

1 . Conduct a Longitudinal Study of Interest 

In this developmental effort, the scales of the inventory 
were validated using a standard cross-validation technique. 
Although this method of validating measures is worthwhile, 
it falls to examine the facet of validity that deals with 
prediction. In other words, do the interest scales, or 
scores, obtained early in an airman's service predict his 
subsequent satisfaction with his Air Force career field? 
This is an essential aspect of validity that must be confirmed 
prior to the adoption of any measure of vocational interest. 

Associated with this problem is the question of whether 
interests change as a result of experience in a career. It 
should be recalled that the mean scores, produced in this 
field test for the occupational scales, showed that airmen 
dissatisfied with their careers were closer in their interests 
to satisfied airmen in the same career than they were to 
men- in-general. One reason for this may be that airmen who 
are dissatisfied with a career field do, nevertheless, acquire 
some interests in common with those of men who like the field. 
Knowledge of the job, alone, may cause this. The validity of 
this hypothesis needs to be examined. 

Also, a longitudinal study would allow th^ examination of 
the, stability of many items comprising the M-scale. For 
example, two items, '^March in a parade** and "Take part in 
a military drill," appear in a number of scales. These items 
appeared to identify satisfied personnel, in that men 
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satisfied with their careers tended to dislike these 
activities more than men-in-general. However, this may 
have been due to the fact that the men- in-general had not 
been extensively exposed to such activities. After some 
experience In military service, recruits' Interest in these 
activities may decline regardless of their satisfaction with 
their careers. By identifying' items such as these, which 
may correlate vrith time and thus be useless for prediction, 
the validity of the scales would be increased. 

Such a longitudinal study could be conducted by following 
up the men- in-general group used in this field test. Repeated 
annual administration of VOICE to these men would provide an 
initial group, to be followed by subsequent groups of new 
recruits until a sample size is reached that would allow 
significant longitudinal inference. 

Expand the Men- In-General Group 

Closely associated with the longitudinal study cpuld be the 
establishment of accurate statistical data for men- in-general. 
The importance of obtaining sound estimates for the distributions 
of responses cannot be overstated. The men- in-general group 
is used in either the development or validation of all scales 
for VOICE. 

Strong (1954, Chapter 21) illustrates the effects of different 
men-in-general groups on correlations, scoring weights, and 
occupational clusters. His first men- in-general sample 
consisted of a few thousand men he happened to test during 
development of the Vocational Interest Blank. He modified 
this sample by using various U. S. Bureau of the Census 
statistics on employed men. Since then, several revisions 
of the men- in-general group have taken place. The group 
developed in 1969 includes some 1,000 men, stratified by 
general occupational area and specific occupation (Camipbell, 
1971, pp. 398-399, for a detailed listing of occupations). 

The statistical standards of the men-in-general group used to 
test VOICE do not compare to Strong. We urge that the group 
be revised until it possesses sufficient quality. Under 
simple random sampling, a sample size of 625 airmen ensures 
that the standard deviation of a proportion will be less than 
0.02. 

The precision could undoubtedly be increased by using 

stratified sampling. Men-in-general could be stratified 

by numerous variables (AQE score range, career field, and 

biographical information are possibilities) in conjunction 

with other studies of their interest. There is no doubt 

that scales developed with such a sample would be more accurate. 



3. Test the Factor Structure of the A Priori Scales 



As stated in the section describing the development of the 
a priori scales, an underlying structure was hypothesized for 
the inventory and items were written accordingly. Whether 
that structure was, in actuality, realized is open to question. 
A factor analysis of the items in VOICE would serve to answer 
the question. The analysis could be conducted immediately 
without collecting additional data. 

The results of a factor study might indicate that the 
a priori scales shovild be modified. For example, the factor 
analysis might suggest that the Pedagogy and Scientific scales 
be combined or that the Outrioor scale be divided into two 
separate scales. 

4. Revise the Job Satisfaction Scale 

The evidence gathered indicates that the nature of the work 
itself is of overwhelming importance in determining how 
satisfied an airman is with his career. Rather than include 
scales which attempt to measure other factors, it is suggested 
that the job content scale be made more reliable by adding 
items and performing an item selection such as that performed 
for the a priori scales in the present study. 

5. Investigate the Use of Continuous Criterion for Prediction 

An alternative to a two group criterion should be investigated. 
A multiple regression approach with a continuous criterion 
would seem to have the most promise. It would develop 
inventory scales against a continuous criterion of satisfaction. 
An individual could then be classified on the basis of a 
combination of manpower requirements in various fields and 
the satisfaction scores predicted for him in each of several 
fields. 

6. Examine Methodological Problems 

A number of methodological questions surround the traditional 
procedures used to key interest inventories. The effects of 
explicit selection on AQE scores are not clear; i.e., men are 
eligible for certain careers only within a given AQE score 
range. For example, it would not be necessary to use the 
Weather 0>>server*s scale for anyone with an AQE General score 
of less than the minimum required score of 80, Thus, 
discrimination should be only between satisfied Weather 
Observers and those men-in-general whose AQE score qualifies 
them. Such explicit selection could have considerable impact, 
particularly when the occupational scale approach is used. 
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Along the same line there are undoubtedly other selection 
factors producing self-selection. To what extent these factors 
affect the interest scales and decision rules arrived at is 
not clear. It may be possible to perform range-restriction 
correlations of some kind on the within-career personnel, 
using a men- in-general sample to achieve more useful results. 
A theoretical and an empirical approach to these questions 
is needed. 

7. Estimate Test-Retest Reliability 

The establishment of reasonable standards of measurement error 
for a psychological test is paramount in a developmental effort. 
When an instrument contains a significant degree of measurement 
error, its usefulness as an indicator of present status or a 
predictor of future behavior is questionable. In psychological 
measurement, measurement error is defined in terms of reliability 
coefficients and is frequently estimated by obtaining correlations 
between two administrations of a test. 

If a correlation between scores on two administrations is to 
serve as an estimate of reliability, the administrations must 
be conducted independently and under identical conditions. 
Furthermore, the distributions of the test scores for the two 
administrations must be identical. 

It is recommended that such a reliability study be conducted 
under strict control. The subjects for this study should not 
be in basic training, but rather in the field • This would make 
the assumption of equal variance more likely to occur in 
practice. The test should be administered under supervised 
conditions on both occasions and under conditions in which the 
men are at ease so that measurement of their interests can take 
place with minimum error. 

8. Establish Methodology for Differential Assignment 

No attempt was made to differentiate career interests among 
two or more of the various career fields at one time. The 
inventory may classify a man to an occupational area as opposed 
to men-in- general. However, in the actual recruiting situation, 
the man who could conceivably have interests identified in more 
than one career area must be assigned to only one area. In 
a situation such as this, the comparability of interest scores 
must be taken into consideration. To do this, an investigation 
should focus on interest scales which differentiate between a 
number of career fields being considered simultaneously. 

9 . Develop Additional Scales 

The occupations represented in the present study were necessarily 
limited. It is strongly recommended that a greater variety 
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of occupational scales than the eight in the present study 
be developed. It is suggested, however, that this step not 
be carried out until some of the questions and issues addressed 
in the preceding recommendations have been answered. 
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